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A system and method for broadcasting data in accordance 
with predicted usage. In accordance with the method, the 
predicted usage of data records within a given set is deter- 
mined. The broacast order is then determined based on the 
predicted usage and the data records are broadcast in the 
order determined. Two kinds of embodiments are consid- 
ered: one in which variant index fanouts are not allowed 
(i.e., fixed fanout has to be used), and the other in which 
variant index fanouts are allowed. For the case of fixed index 
fanouts, a first method for the optimal index tree construc- 
tion minimizes the average cost of index probes. For the case 
of variant index fanouts, a second method (method 2) builds 
index trees with variant fanouts. The first method uses 
access frequencies of data records to build a fixed fanout 
index tree. In the second method, the number of fanouts of 
each index is determined as a function of the access fre- 
quencies of those nodes (data or indexes) that the index node 
points to. 
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INDEX ALLOCATION FOR DATA 
BROADCASTING 

I. BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to broadcasting data in a 
wireless mobile computing environment. 

2. Related Art 

Mobile computing, referring to the activity of using 
personal digital assistants, such as pahntops and notebook 
computers, to access a large number of databases via wire- 
less networks, is an emerging technology. A significant 
amount of research effort has been directed towards explor- 
ing various aspects of mobile computing, such as energy 
saving, cache management, query processing and data allo- 
cation. Applications such as using palmtops to access airline 
schedules, stock activities, u-affic conditions and weather 
information on the road are expected to become increasingly 
popular. It is noted, however, that several mobile computers, 
such as desktops and palmtops, use small batteries for their 
operations and are not directly connected to any power 
source. As a result, energy saving is a key issue to resolve 
before one can anticipate an even wider acceptance of 
mobile computers. 

In general, a mobile server is expected to concurrently 
serve hundreds or even thousands of clients. In large part, 
because the cost of broadcasting is independent of the 
number of users, periodic broadcasting is the prevalent 
method for information dissemination in a wireless commu- 
nication environment. When a palmtop is actively listening 
to a channel, its CPU is in an active mode to examine data 
packets so as to determine if they match the predefined 
patterns. Such CPU operations for data examination con- 
stune much more power than those for simply receiving 
data. Therefore, to achieve energy savings it is highly 
desirable to let the palmtop stay in the doze mode most of 
the time and only enter the active mode when it is necessary. 
As a consequence, it is advantageous to use indexed data 
organization to broadcast data over wireless channels so that 
those mobile units can be guided to the data of interest 
cfiBciently and only need to be actively listening to the 
broadcasting channel when the relevant information 
(indexes or data) is present. The structure of an index tree 
determines the index probing scenario for data access under 
such an indexed broadcasting. 

A conventional index tree is given in FIG. la, and its 
corresponding broadcasting sequence is shown in FIG. lb. 
Suppose that record R5 is the record to be accessed. Then, 
after being routed to the root index, I, the request will probe 
I and a^, and then reach R5. Clearly, using this index tree, a 
request to any record will take two index probes. Note that 
the number of index probes is proportional to the amount of 
time that a mobile unit has to stay in the active mode. 

Typically, conventional wireless servers provide a sym- 
meUic balanced index tree with essentially the same fanouts 
for all index nodes. One possible reason for this is that in a 
conventional file system (one that is not used in conjunction 
with wireless mobile computing) the cost of executing an 
index probe for data access is almost negligible as compared 
to other CPU and I/O operations. Thus little overall perfor- 
mance improvement is obtained by minimizing the number 
of index probes. Such a cost model, however, does not hold 
in the context of wireless mobile computing, in which the 
cost for a mobile unit to listen to the indexed broadcasting 
message is one of the major sources for power consumption. 
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Hence, reducing the average number of index probes will 
result in a lower power consumption. 

It is noted that in most databases the access frequencies of 
different data records are usually different horn one another. 
5 This phenomenon is termed data access skew. 

II. SUMMARY OF THE INVENTION 

In light of the above, the present invention provides a 
system and method for broadcasting data in accordance with 
10 predicted usage. In accordance with the method, the pre- 
dicted usage of data records within a given set is determined. 
The broacast order is then determined based on the predicted 
usage and the data records are broadcast in the order 
determined. 

In the preferred embodiments, two kinds of systems are 
considered: one in which variant index fanouts are not 
allowed (i.e., fixed fanout has to be used), and the other in 
which variant index fanouts are allowed. For the case of 
fixed index fanouts, a first method (method 1) for the 
optimal index tree construction that minimizes the average 
cost of index probes is provided. For the case of variant 
index fanouts, a second method (method 2) for building 
index trees with variant fanouts is provided. 

The first method uses access frequencies of data records 
to build a fixed fanout index tree. Exploiting data access 
skew, the index tree built by the method could be imbal- 
anced. Thus, in cases where variant fanouts are allowed the 
second method is preferred. In the second method, the 
number of fanouts of each index is determined as a function 
of the access frequencies of those nodes (data or indexes) 
that the index node points to. 

After an index tree is built, the mobile unit can locate the 
data of interest, according to the index tree, with the minimal 
35 probing cost. It is noted that the average cost of index probes 
can be significantly reduced not only by employing an 
imbalanced index tree that is designed in accordance with 
data access skew, but also by exploiting variant fanouts for 
index nodes. 

40 These, and other features and advantages of this invention 
will become apparent from the following detailed descrip- 
tion of the invention taken in conjunction with the accom- 
panying drawings. 

III. BRIEF DESCRIPTION OF THE DRAWINGS 

45 

FIG. Ifl is an illustration of the conventional index tree for 
use in broadcasting by a mobil server; 

FIG. lb shows a conventional index probing scenario; 

FIG. 2 illustrates a wireless mobile computing environ- 
^0 ment including a mobil server which broadcasts data in 
accordance with an embodiment of the present invention; 

FIG. 3 shows a flowchart of the broadcast method 
employed in the mobile server of FIG. 3. 

FIG. 4fl illustrates an imbalanced index tree constructed 
in accordance with an embodiment of the present invention; 

FIG. 4b shows a data broadcasting sequence correspond- 
ing to the index tree of FIG. 4a. 

FIG. 5 shows the process of grouping a set of nodes and 
60 moving them to a lower level to reduce the average index 
probe cost in accordance with an embodiment of the present 
invention; 

FIGS. 6a-6d show an iUustrative example for a variant 
fanout method (VF) according to an embodiment of the 
65 present invention; 

FIG. 7 shows a resulting index tree with variant fanouts 
from the variant fanout method of FIG. 6; 
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FIG. 8 is a flowchart of how the data in the database is In step 330, the data is organized according to the index 

broadcast by the mobil server of FIG. 2; tree built in step 320. In particular, the data is organized such 

RG. 9 shows the access probabUity for each data record that every index node of the index tree contains aU of the 

in an example database; pointers needed to access its child nodes. 

FIG. 10 shows the access probability for each data record ^ After the data has been organized, in step 340 the database 

in another example database; broadcast according to the index tree such that nodes in 

FIG. 11 is a flow chart of a fixed fanout method for '"'8'?=' levels (closer to the top of the tree) are broadcast 

constrm:tiiig an index tree according to an embodiment of '"^^^ ^""^ """^ 

the present invention; and, Method 1, which handles a constant fanout, will now be 

no. 12 is a flow chart of the variable fanout method for " ""J?^* '^"^?^}° FIG^ll. In essence, 

constructing an index tr«e according to an embodiment of ^« reduces the number of index probes for hot (i e., 

the presen7invention. ^^^"f^ accessed) data whUe aUowing more probes for 

„^ . „ . o , . . , ^r^^ cold (less frequently accessed) data. In the preferred 

HG. 13 is a flow chart of the PartiUon procedure of FIG. embodiment, this method is implemented by way of pro- 

gram instructions executing on a general purpose computer. 

IV. DETAILED DESCRIPTION OF A In the description below, n is the number of total data 

PREFERRED EMBODIMENT records and d is the fanout of each index. The number of data 

records is denoted as n, and each data record is denoted as 

FIG. 2 shows a wireless mobile computing system in R.,whercl^i^o. Let Pr(R..) be the access probabiUty of R,-, 

accordance with an embodiment of the present invention. A i e., the sum of all Pr(R;) for l^i^n. Also, a,, is used to 

mobile server 100 includes a database system 102 (which represent an index node in an index tree and d(a,) represents 

includes a collection of data records to be broadcast and data the fanout of a,. Path(R;) is the set of index nodes from the 

processing logic for maintaining and accessing the data root to data record R^. Referring to FIG. la, as an example, 

records), data access statistics 103 (which can be stored, for d(ai)=3 signifying that node al has a fanout of three records 

example, in semiconductor or disk memory), and a broad- (ri^ r2 and R3). Path(R3)={I, a^} in FIG. la signifying that 

casting controller 104. The broadcasting controller 104 the path to get to record R3, traverses the index node (I) and 

includes an index organizer 104^ which utilizes the access then node al. 

statistics 103 to organize the data records to be retrieved j^gt^od 1 uses the access frequencies to build an index 

from the database 102 for broadcasting. Jh^ index organizer ^ ^j^i, ^ fi^^j ^n^^j ^ ^^^^ ^^e method initializes 

can be embodied as program code m accordance with FIGS. ^ ^^^^j ^ subtrees, each of which is a single node labeled 

3-13. The mobU server 100 broadcasts selected data records ^jj, corresponding access frequency. Next, in step 1104 

from the database by way of a radio transceiver 105. The ^ g^^trees with the smallest labels are attached to a new 

mobUe server 100 also receives messages (e.g. requests) QQ^jg resulting subtree is labeled with the sum of all 

from mobile computing units (mobil units) 105-109 by way i^^eis from its d chUd subtrees, n is then decremented by the 

of the radio transceiver 105. The messages are processed by ^^^^ f^j,out d and the result is increased by l(n-(n-d)+l), 

the broadcast controller which, in turn, determines the Then, n is the number of remaining subtrees. In step 1108 it 

access statistics 103 for the dau records and stores them in ^3 determined if ther« are additional substrees remaining 

the memory. The mobile units can be m doze mode, saving (^^j) ^^^^ ^^^^^^ ^xits in step UO. Otherwise the next 

power consumption, such as the ones marked 106 & 109. or g^^trees is initialized in step 112 and the method is 
be in active mode, receiving data from the transceiver, such ^ repeated from step 1104. 

as the ones marked 107 & 108. Those of skill in the art will „ 1 • ci • n j j 1 .u j 

. -11 1 • 1 J For example, given the profile m FIG. 9 and do3, method 

recognize that the mobile server will also mclude a number . n l -i j • l 1 j ■ j • 

- X 1 . . L * 1 will build an imbalanccd index tree m a bottom up manner 

of other conventional system components, which are not the j^^^ imbalanced index tree with a fanout of three 

subject of the present mvention. /j • . j iv \ i. • a t r -r* . 
J f 45 (designated as V^^) ^ shown m FIG. 4a. To facilitate our 

A conventional index tree is shown in FIG. la, and its presentation, the index tree in FIG. la is denoted by T^^ 3. 

corresponding broadcasting sequence from the mobile ^hc superindexes of T^^ 3 and 3 stand for "balanced" 

server is shown in FIG. lb. Suppose that a given record, R5, "imbalanced", respectively. Denote the cost of probing 

is to be accessed by a mobile unit. After being routed to the ^n index a, as f(a^. Then, the average cost of beating a 

root index, I, the mobae unit will stay in the doze mode ^^^^^^ p^^ing indexes can be expressed as: 
while the irrelevant information, al, Rl, R2 and R3, is being 

broadcasted, and go back to the active mode only when the £t£i£«Pr(R,)^ycp«iA<jr^f(ai) 

relevant information, a, and RS, is present. where Path(R..) is the set of index nodes from the roottodata 

FIG. 3 shows a flowchart of the broadcast operation of the record R -. 
mobile server 100, 55 without loss of generality fl[ai)-g(d(a,))-d(a.-) is used in 

First, in step 310. the mobile server collects the data our discussion. It can be verified from FIG. 9 that by using 

access frequencies. More specifically, the mobile server the imbalanced index tree l^j^, instead of T^rf.3, the 

monitors all requests from mobile units and stores the access average cost of an index probe for data access can be 

frequencies of all data items as the data access statistics 103 significantly reduced, thus reducing the average power con- 
(FIG. 2). The data access statistics arc stored such that every 50 sumption. A corresponding broadcasting sequence is shown 

data item is paired with its access probability. The access in FIG. 4b. Consequently, given a fixed index fanout. the 

probability is the probability thai the corresponding item is average cost of index probes is minimized by using the index 

accessed by an incoming request. tree constructed by method 1. 

Next, in step 320 the statistics collected in step 310 are Next, method 2, which uses variant index fanouts to 
used to build the index tree. The method used to build the 6S minimize the average cost of index probes is described. It is 

index tree can be either method 1 for a constant (fixed) noted that unlike the conventional file system where a 

fanout or method 2 for a variant fanout. simple data structure is usually preferable, the approach of 
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using variant index fanouts can be implemented in the 
indexed broadcasting environment to lead to performance 
improvement. 

it is observed that on one hand, we want data records to 
stay as close to the root as possible, which, on its own, 
suggests that all data records be attached to the root node. On 
the other hand, however, indexes with larger fanouts are in 
general more costly to probe, and thus imdesirable, particu- 
larly for those data records with high access frequencies. 
Method 2 strikes a compromise between these conflicting 
factors and minimizes the average cost of index probes. 

In the preferred embodiment, method 2 is greedy in nature 
and builds the index tree in a top down manner. Method 2 
starts with attaching all data records to the root node. Then, 
after some evaluation, the method groups nodes with small 
access frequencies and moves them to one level lower so as 
to minimize the average index probe cost 

FIG. 5 shows the scenario of grouping a set of nodes and 
moving them to a lower level. It can be seen from FIG. 5 that 
while the cost of index probes is increased for those nodes 
moved down to the next level (i.e., the index probe cost for 
each node among h^^^, hj-^2» • • • » ^ ^ increased from 
m to (i+l)+(m-i)=m+l), the index probe cost for the other 
nodes will be greatly reduced (i.e., the index probe cost for 
each node among hj, h2, • ■ - , and h^- is reduced Grom m to 
i+1), thereby resulting in an overall reduction on the average 
index probe cost. Method 2 contains a recursive procedure 
Partition to identify the group of nodes to be moved down- 
ward in each execution level. 

A flow diagram of method 2 is shown in FIG. 12. 

First, in step 1202 the records (R^, R^, . . . , RJ are sorted 
into an initial broadcast sequence according to descending 
order of access probability, Pr(R^-), where l^j^n. At the 
conclusion of the sorting the sequence will be organized 
such that Pr(Ry)lPr(Rjfc) if and only if j<k. 

Next, in step 1204 the Partition procedure is performed, 
Partition (Rj, R^, • • , RJ. 

Then, in step 1206 the resulting index tree is reported to 
the server. When the next broadcast in the periodic broadcast 
cycle is commenced, in step 1208 the records are broadcast 
in the order indicated by the index. 

The Partition procedure, shown in more detail in FIG, 13. 
In step 1302 the procedure starts with a configuration where 
ail nodes are attached to the root. In step 1304 it is deter- 
mined if the average index probe cost can be reduced by 
moving a set of nodes to the next level. 

in step 1306 the specific set of nodes whose movement 
would be beneficial are identified and moved to the next 
lower level of the index tree (further from the index node). 
If more than one set of nodes is identified (i.e. if there is 
more than one candidate set of nodes whose movements 
would be beneficial) the one with the maximal reduction on 
the index probe cost is chosen and moved. 

After the initial set of nodes has identified and moved to 
the next level, in step 1308 the Partition procedure is called 
recursively to evaluate each of the nodes by themselves to 
see if any further downward movement for some of them is 
necessary. 

In step 1310 procedure Partition is called again to see if 
any further downward movement for some nodes in the new 
list is necessary. This recursive call of the Partition proce- 
dure operates on the original level by replacing those nodes 
moved downward with a new index node (e.g., h^ in FIG, 5) 
and assigning that node with an access frequency equal to 
the aggregate frequency of its chUd nodes, 

Procedure Partition partitions the nodes recursively with 
the objective of minimizing the average index probe cost. 
The index tree is then constructed in a top down manner. 
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To described the partitioning method in more detail, in 
step 1302 the Partition procedure is called for a tree of nodes 
(hi, h2, . . . , hm). In step 1304, let y(i)=<m-i-l)2is.s, 
Pr(hp-2:,-^isy s„ Pr(hj). Determine i* such that y(i^)o 

5 maxv^{,-^ m-2){y(0}- if y(i*)=0 Ihcn return. Then, in step 
1306, attach nodes hi*+l, hi*+2, . . . , hm under a new index 
node hx in the index tree. In step 1308 recursively call the 
Partition procedure for the restructured tree (hi*+l, 
hi* +2, . . . , hm). 

In step 1310, insert hx into the ordered list (hi, h2, . . . , 
hi*) and relabel them as (hi, h2, . . . , hi*+l) according to 
descending order of Pr(hj), l^j^i*+l. The again recur- 
sively call the Partition procedure for the restructure tree 
(hi, h2, hi*+l). Then return to the calling program in step 

15 1312. 

Use the data profile in FIG. 10 for an example. The initial 
index tree configuration in shown in FIG. 6a, where all data 
records are attached to the root. Procedure Partition then 
determines the optimal group of nodes to be moved to the 
next level. It is obtained that i*»4, and therefore group nodes 
R5, R6, . . . , and Rll together and move them to the next 
level, resulting in the configuration shown in FIG. 6b. Nodes 
R5, R6, . . . , and Rll under the index node al are then 
partitioned recursively. Also, in the original level, nodes R5, 
R6, , . . , and Rll are now replaced with al which is assigned 
with an access frequency of 0.12. It next determines if a 
further partition for the new list of child nodes under the root 
(i.e., Rl, R2, R3, R4 and al) is necessary. Consequently, the 
resulting index tree is shown in FIG. 7. 

FIG. 8 is a flowchart showing, with a given index tree, 
how the data in the database in broadcast. In step 810 the 
initial broadcasting sequence SEQ is formed by data items 
sorted in descending order of their access probabilities, i.e. 
SEQ=(R1, R2. , . , Rn), where Pr(Rj)^Pr(Rk). Step 820 
2j selects an index node, say a, from the lowest level of the 
index tree T and removes the child nodes of al from T (i.e. 
al this becomes a leave node). In step 830 ai is inserted in 
SEQ and placed immediately in front of its child node with 
the largest access probability. Then, in step 840, steps 
^ 820-830 are repeated imtil T becomes a single node. In step 
850 the data is broadcasted. 

Now that the invention has been described by way of the 
preferred embodiment, various modifications and improve- 
ments wiU occur to those of skill in the art. Thus, it should 
be understood that the preferred embodiment has been 
provided as an example and not as a limitation. The scope of 
the invention is defined by the appended claims. 

What is claimed is: 

1. A method for broadcasting data records and indexes 
from a mobile server, comprising the steps of: 

determining access frequencies of al least a subset of the 
data records; 

building an index tree having index nodes with child 
nodes comprising at least one of a further index node 
55 and a data record, by: 

(a) attaching the subset of data records to a root node 
of the index tree; and 

(b) recursively identifying at least one group of nodes 
with small access frequencies and moving the al least 

50 one group downward to a lower level in the tree; 

dynamically determining a broadcast order of the data 
records based on changes in the structure of the index 
tree; and 

broadcasting the data records and the indexes based on the 
65 broadcast order. 

2, The method of claim 1, wherein access to a data record 
is associated with a number of index probes, and the number 
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of index probes for a data record with a higher access 
frequency is less than the number of index probes for a data 
record with a lower access frequency. 

3. The method of claim 1, wherein each index node 
contains pointers for accessing all of its child nodes. 

4. The method of claim 1, fUrther comprising the step of, 
after building the index tree, reporting the index tree to the 
mobile server, 

5. The method of claim 1, wherein the mobile server 
monitors requests for data records from mobile units. 

6. The method of claim 1, wherein the mobile server 
stores data records with corresponding access frequencies. 

7. The method of claim 1, wherein the number of child 
nodes per index node is fixed. 

8. The method of claim 1, wherein the index tree is 
imbalanced. 

9. A method for broadcasting data records and indexes 
from a mobile server, comprising the steps of: 

determining access frequencies of at least a subset of the 
data records; 

building an index tree with a variant fanout comprising 
nodes of one of indexes and data records based on the 
access frequencies of the data records, by: 

(a) attaching the set of data records to a root node of the 
index tree; and 

(b) recursively identifying at least one group of nodes 
with small access frequencies and moving the at least 
one group downward to a lower level in the tree; 

dynamically determining a broadcast order of the data 
records based on changes in the structure of the index 
tree; and 

broadcasting the data records and the indexes based on the 
broadcast order. 

10. The method of claim 9, wherein the number of fanouts 
from each index node is determined as a function of the 
access frequencies of the nodes to which the index node 
points. 
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11. The method of claim 9, wherein the step of building 
the index tree further comprises the step of determining 
whether an average index probe cost can be reduced by 
moving a set of nodes to a lower level in the tree. 

12. The method of claim 9, wherein the step of building 
the index tree further comprises the step of, after moving at 
least one group of nodes downward to a lower level: 

replacing the at least one group of nodes with a new index 
EK)de; and 

assigning the new index node with an access frequency 
equal to an aggregate frequency of the child nodes of 
the new index node. 

13. The method of claim 9, further comprising the step of, 
after building the index tree, reporting the index tree to the 
mobile server. 

14. A mobile computing broadcast server system, com- 
prising: 

means for determining access frequencies of at least a 

subset of data records; 
means for building an index tree having index nodes with 

child nodes comprising at least one of a further index 

node and a data record, wherein the index tree is built 

by: 

(a) attaching the subset of data records to a root node 
of the index tree; and 

(b) recursively identifying at least one group of nodes 
with small access frequencies and moving the at least 
one group downward to a lower level in the tree; 

means for dynamically determining a broadcast order of 
the data records based on changes in the structure of the 
index tree; and 

means for broadcasting the data records and the indexes 
in the broadcast order. 



05/27/2004, EAST Version: 1.4.1 



